Information Retrieval in Distributed Hypertexts

نویسندگان

  • Paul De Bra
  • Geert-Jan Houben
  • Yoram Kornatzky
  • Renier Post
چکیده

Hypertext is a generalization of the conventional linear text into a non-linear text formed by adding cross-reference and structural links between different pieces of text. A hypertext can be regarded as an extension of a textual database by adding a link structure among the different text objects it stores. We present a tool for finding information in a distributed hypertext such as the World-Wide Web (WWW). Such a hypertext is a distributed textual database in which text objects residing at (the same and) different sites have links to each other. In such a database retrieval is limited to the transfer of documents with a known name. Names of documents serve as links between different documents, and finding such references names is only possible by parsing documents that have embedded links to other documents. Full-text search in such hypertexts is not feasible because of the discrepancy between the large size of the hypertext and the relatively low bandwidth of the network. We present an information retrieval algorithm for distributed hypertexts, which does an incomplete search through a part of the hypertext. Heuristics determine the selection of the documents that are to be retrieved and searched. A prototype implementation for the WWW, on top of Mosaic for X, is being used by an increasingly large user base.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TACHIR: A Tool for Automatic Construction of Hypertexts for Information Retrieval

The paper describes the design and implementation of TACHIR, a prototype tool for the automatic construction of hypertexts for Information Retrieval. TACHIR builds up automatically an IR hypertext, a hypertext to be used for information retrieval, from a document collection, using a methodology that makes use of a set of well known Information Retrieval techniques. The structure of the IR hyper...

متن کامل

Design and Implementation of a Tool for the Automatic Construction of Hypertexts for Information Retrieval

The paper describes the design and implementation of TACHIR, a tool for the automatic construction of hypertexts for Information Retrieval. Through the use of an authoring methodology employing a set of well known Information Retrieval techniques, TACHIR automatically builds up a hypertext from a document collection. The structure of the hypertext re ects a three level conceptual model that has...

متن کامل

Components of a Model of Context-Sensitive Hypertexts

On the background of rising Intranet applications the automatic generation of adaptable, context-sensitive hypertexts becomes more and more important [El-Beltagy et al., 2001]. This observation contradicts the literature on hypertext authoring, where Information Retrieval techniques prevail, which disregard any linguistic and context-theoretical underpinning. As a consequence, resulting hyperte...

متن کامل

Advanced Studies on Link Proposals and Knowledge Retrieval of Hypertexts with CBR

In this paper, several problems in associating hyperlinks to text and the diverse possibilities to overcome these problems are discussed. At the current stage, an important aspect is knowledge retrieval of hypertexts. Our advanced studies on hyperlink management focus mainly on a concept similar to Case-Based Reasoning (CBR) systems as a possibility for the automatic generation of links for hyp...

متن کامل

Probabilistic Argumentation Systems Applied to Information Retrieval

In this dissertation, a new logical model of information retrieval is developed and evaluated experimentally. This model is built on a general technique for uncertain reasoning called probabilistic argumentation systems (PAS), in which propositional logic and probability theory are combined to represent and handle uncertain knowledge, both in a symbolic and in a numerical way. The logical model...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994